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, Abstract 

G ' We discuss the interbasin kinetics approximation for random walk on a complex land- 

c/3 . scape. We show that for a generic landscape the corresponding model of interbasin kinetics is 

equivalent to an ultrametric diffusion, generated by an ultrametric pseudodifferential opera- 
tor on the ultrametric space related to the tree of basins. The simplest example of ultrametric 
diffusion of this kind is described by the p~adic heat equation. 



§ '. 1 Introduction 

Q 

Dynamics of a broad class of complex systems (glasses, clusters, polymers) is described by a 

random walk on a complex landscape of energy P, [2], [3], [1]. Landscape is a real valued 
J^ . function (energy) on a domain in R^. Complex landscape is a function which possesses many 
lO '• local minima. In particular, models of this kind are important for description of protein dynamics 
"^ I in the relaxation approach [5j . Therefore approximations of dynamics on complex landscapes are 
important for applications. 

We discuss the random walk on the complex energy landscape, given by the real valued function 

t^^ . U{x) on R^, with the temperature T and the inverse temperature f3 = 1/kT {k is the Boltzmann 

constant). This random walk is defined as follows: the transition probability rate for transitions 

between the two neighbor infinitesimal vicinities Oi, O2 of the energy surface will be proportional 

/^ ' to the Boltzmannian factor exp(— /3At/), where Af/ is the energy difference for the sets Oi and O2. 

c^ ■ This formula is valid for transitions which increase energy, for transitions which decrease energy 

we put the Boltzmannian factor equal to one. 

For random walk under discussion the system will spend more time in the low energy areas of 
the energy landscape. Therefore we get the following picture — the system stays in the vicinities 
of local minima and performs transitions between local minima through the energy barriers. For a 
generic landscape local minima will be hierarchically clustered with respect to the energy barrier 
between the minima. 

These arguments suggest the approach of interbasin kinetics, which is the approximation of a 
dynamics on a complex landscape, based on the description of the kinetics of transitions between 
the groups of states, called basins. The minimal basins correspond to local minima of energy, the 
larger basins are hierarchical unions of smaller basins. 
The postulates of interbasin kinetics: 
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(1) The space of states is separated into basins, basins are separated into subbasins in a 
hierarchical way. 

(2) The activation energy barrier between the two states depends only on basins, containing 
these states and does not depend on the choice of states in these basins. 

Therefore the transitions between basins are described by the system of kinetic equations: 

|/(^, t) = -i: [Tit, J)f{t, t) - T{j, z)f{j, t)] u{j) (1) 

Here the indices i, j enumerate the states of the system (which correspond to the minimal basins, 
or local energy minima), T{i,j) > is the probability rate for transitions from i to j, v{j) > 
are positive numbers (volumes of the basins) . 

The described constraints of interbasin kinetics on the matrix T{i,j) imply that this matrix 
will be a block matrix with a large number of equal elements. The important example is the Parisi 
matrix T{i,j) (used in the theory of spin glasses, see [6]). 

Various models of interbasin kinetics and hierarchical dynamics were studied in many papers, 
see [7], |8], [9], |T0]. In papers |Tl], |I2] ]9-adic diffusion was discussed in relation to the relaxation 
of spin glasses. 

Protein dynamics was studied, with the help of the Mossbauer spectroscopy by H.Frauenfelder 
[1] and V.I.Goldansky [13]. Hierarchical approach to description of the space of state of a protein 
was proposed by H.Frauenfelder, see [1]. 

In [13] it was proposed the approach to describe the interbasin kinetics models with the help 
of ultrametric diffusion, generated by pseudodifferential operators. Namely, the postulates of 
interbasin kinetics are put into the form: 

(1) = the space of states is ultrametric 

(2) = transition probability rate is locally constant 

In the simplest case (when T{i,j) is the p-adic Parisi matrix of some simple form, all z/(j) 
are equal) the system of equations of interbasin kinetics takes the form [15] of the p-adic heat 
equation 

PXx,t) + D^f{x,t) = (2) 



Initially this equation was introduced in [16] from purely mathematical motivations. Here D° is the 
Vladimirov operator of p-adic fractional differentiation with respect to x. This parameter describes 
the tree of basins for the complex energy landscape, in the system of equation of interbasin kinetics 
([1]) the parameter x corresponds to the index i of the local minima. For the models of protein 
dynamics x is the conformational parameter. p-Adic models of interbasin kinetics were discussed 
in [15], [13], m- 

Procedures of construction of the hierarchy of basins and of models of interbasin kinetics 
starting from the energy landscape were studied by Stilinger and Weber [TS], [IH], Becker and 
Karplus [2D] . These models were applied to construction of hierarchy of basins for peptides [2U] 
using the data of molecular dynamics. Complex landscape in this approach is approximated by 
the disconnectivity graph and the function of energy barriers on this graph. 

In the present paper we construct the equation of ultrametric diffusion which describes the 
interbasin kinetics approximation for the dynamics on a complex landscape of a generic form. 



This equation has the form of the following ultrametric pseudodifferential equation 

^/(^, t) + / -T—^ ^ e^^(^V(x, t) - e^^(^)/(y, t)] du{y) = (3) 



dt ' Jx z/(sup(x, y)) 

Here x,y & X lie in the ultrametric space which describes the tree of basins for the landscape of 
energy, /(x, t) is the distribution of occupation. For a wide class of landscapes the above equations 
are exactly solvable. Thus the dynamics on complex landscapes in these cases can be investigated 
analytically. The important example of the above equation is the p-adic heat equation ([2]). 

The exposition of the present paper is as follows. 

In Section 2 we describe the procedure of construction of the tree of basins and the function 
of activation energy barriers for a generic landscape. 

In Section 3 we construct the corresponding general model of interbasin kinetics. 

In Section 4 we show the equivalence of the interbasin kinetics model of Section 3 and the 
model of ultrametric diffusion on the space corresponding to the tree of basins. 

In Section 5 we discuss the clustering procedure. 

In Section 6 we put some material on ultrametric analysis. 

2 Energy landscape and the tree of basins 

Let us describe the procedure which puts into correspondence to an energy landscape U (a smooth 
real valued function defined in a domain (or the configuration space) M C R^) the tree of basins, 
the function on this tree which describes the activation barriers for a random walk on the landscape, 
and the measure on the border of the tree of basins which describes volumes of the corresponding 
basins. 

Let us consider the set of all local minima of U. We assume that this set is finite. For the 
local minimum i consider the set R{i) in the configuration space M (the basin of attraction of i), 
which contains the points ^ G M, for which: 

1) There exists a path (i.e. a continuous curve) in the configuration space, which connects ^ 
and i, and the function U does not increase on the path from ^ to i. 

2) If there exist paths from ^ to several local minima, and the function U is non increasing 
along these paths, then the distance between ^ and i is less or equal than the distances between 
X and the other minima. Here the distance between ^ and i is understood as a distance along the 
surface of energy, i.e. the distance between two points of a landscape is the infimum of lengths of 
paths on the energy landscape, which connect the points. 

The different R{i), R{j) can intersect on the sets of measure zero. The union of all R{i) gives 
the whole configuration space. 

Put into correspondence to the basin R{i) the volume #(i): 



#(z) = f dx 

JR(i) 



Let us introduce the following notations. 

1) Assume that the points a, h are connected by the path S in the configuration space M. We 
say that the point a is separated from h by the energy barrier E at the path S", if the following 



supremum over the points ^ E S is equal to E: 

2) We say that the points a, h in the configuration space are separated by the energy barrier 
(or the activation barrier) E{a, b), if the infimum over the paths S from a to 6 in the configuration 
space of the energy barriers at the path S is equal to E{a, b): 

E{a,b) = inf 5 supgg5f/(0 

Let /3 be a positive number (the inverse temperature). Let us introduce on the set of local 
minima the metric 

For a generic landscape U this metric will satisfy the strong triangle inequality (i.e. will be an 
ultrametric). 

Let us fix the energy scale — the increasing sequence of real numbers {E^}, and the cor- 
responding sequence of positive numbers D = {dk}, dk = e~^^''. Consider the corresponding 
clustering Cd of the set of local minima with the distance d{-, ■) (see the Appendix 1). 

The clusters from Cd we will also call the basins. Let us call the directed tree T for the 
clustering Cd the disconnectivity graph of the landscape U. Using this tree we build the ultrametric 
space X{T) (see the Appendix 2). The points of this space correspond to the local minima of the 
energy landscape, the balls (with respect to the ultrametric) correspond to the basins. One can 
say that a point x corresponds to some local minimum i together with the set of inclusions of the 
corresponding basins which contain i. 

On the space X there exists the natural measure z/, such that the measure z/(a;) of the point 
X E X (the space X in the case under consideration consists of the finite number of points) is 
equal to the volume of the basin of attraction of the local minimum #(?). 

3 Our ansatz of interbasin kinetics 

Consider the tree of basins for the energy landscape, built with the help of the procedure of 
the previous section. Let us construct the system of equations of interbasin kinetics using the 
Arrhenius-Eyring formula, which gives the approximation for the velocity constant of reaction in 
chemical kinetics: 

K = Aexp{-(3AF) 

where k is the velocity constant of reaction, AF is the free energy of activation, A is some constant, 
P is the inverse temperature. Let us remind that the free energy of the group of states (with the 
same energy) is defined as 

F = E-9S 

where E is the energy of the group of states, 6 = j3^^ is the temperature, S is the entropy 
(logarithm of the number of states in the group). 

We consider the system of equations of interbasin kinetics of the form 



dg{i,t) 



dt ... 



J2 [e'3(^»-^(-P(^.^-)))c(,, j)^(^,t) _ e'3(^(^')-^(^"P(*'^'«)C(j,2)(7(j,t) 



Here i, j are minimal basins (which correspond to local minima), g{i) is the occupation of the 
minimal basin i, F{i) is the free energy of the basin i, sup{i,j) is the minimal superbasin which 
contains both the basins i and j, G(sup(i, j)) is the free energy of the transition state for transitions 
between i and j. 

This system of equations is based on the Eyring formula and the assumption that the transition 
state for transitions between i and j is defined by the superbasin sup(2, j)). 

We choose the coefficients C{i,j) to be positive and symmetric. In this case the above system 
of kinetic equations satisfies the conditions of detailed balance. These coefficients describe the 
modification of the Eyring formula on the case of transitions between the groups of states with 
the unique intermediate transition state. Choice of the coefficients C{i,j) fixes the model of 
interbasin kinetics. We propose the following ansatz for the coefficients: 

„,. .> #(»)#(j) ,.. 

Here #(«) is the number of states in the basin i (i.e. the volume of this basin). This choice satisfies 
the scaling conditions — the coefficients C{i,j) do not change with dilatations of the landscape. 
With this choice of the coefficients the system of equations of interbasin kinetics takes the form 

Here f{i) = g{i)/ij^{i) is the density of occupation of the basin i, E{i) is the energy of the basin i 
(i.e. e^''^^'^ = e^^/#(2)). We choose the volumes of the transition states for basins sup(i, j) to be 
proportional to the volumes of these basins: 

e^(^"P(^'^» ~#(sup(i,j) (5) 

where S'(sup(i, j)) is the entropy of the transition state. With this choice of entropy for transition 
states (we ignore the corresponding coefficient of proportionality) the system of equations of 
interbasin kinetics takes the form 

dfd f) p-l3E{snp{i,j)) 

^ = - E TT-j-v^ b ''f^'^ ^) - ^ ''^^^■' ^) #^^') (6) 

dt ~Z^#(sup(2,j)) L J 

Here E{sup{i,j)) is the energy of the transition state for the basin sup(i, j), and this value coincides 
with the energy used in the clustering procedure of construction of the tree of basins. 

Therefore the introduced here ansatz of interbasin kinetics is based on the clustering procedure 
of construction of the tree of basins, the Arrhenius-Eyring formula and conditions (jlj), ([5]), and 
generates the system of equations (Q . 

4 Ultrametric diffusion 

In the present section we show that the system of equations of interbasin kinetics is equivalent to 
the dynamics on the ultrametric space X corresponding to the tree of basins. 
We have the following theorem. 



Theorem 1 The system of equations of interbasin kinetics (0) is equivalent to the ultranietric 

pseudodifferential equation 

where the ultrametric space X corresponds to the tree of basins of the energy landscape, the 
points X of the ultrametric space correspond to the minimal basins i (basins of attraction of local 
minima), the measure v describes volumes of the basins (i.e. for the minimal basin i corresponding 
to the minimal ball x we have z/(x) = i^{i)). 

We will not restrict the consideration of the dynamics on energy landscapes to ultranietric 
spaces containing finite number of points, but instead we will consider the general case of equations 
of the form ([7]). Finite trees of basins are obtained because we consider smooth energy landscapes. 
In reality energy landscapes can be complex and rugged. For a rugged energy landscape the 
described procedure is not directly applicable. Instead we can consider the inductive limit of 
directed trees and related spaces of functions. We investigate the pseudodifferential equation of 
the form ([7]) on the ultrametric space corresponding to the limiting infinite tree, and interpret this 
equation as describing dynamics on a rugged landscape. 

Example Consider the case when X = Qp, the measure z/ is the Haar measure fi, and the 
activation energy is chosen as follows: 

E{\x — y\p) = k\n\x — y\p, k > 0, 

The potential of the minimal basins (point in Qp) is equal to zero. In the notation \x — y\p = p'^, 
the activation energy is linear with respect to 7. We get for the transition probability rate the 
expression 

^-f3E{s\ip{x,y)) ^-I3kln\x-y\p ^ 



u{sup{x,y)) \x-y\p \x - y\l+'^'' 

Equation of interbasin kinetics takes the form of the p-adic heat equation 



where the parameter a of the Vladimirov operator of the p-adic fractional differentiation 

T^arf ,x r-i/ N f f{x,t)- f{y,t) 
DJ{x,t) = Tp (-a) / — , _ ,^^^ — dfi{x) 

jQp \x y\p 

is proportional to the inverse temperature: a = pk. 



Remark Cauchy problem for the p-adic heat equation is exactly solvable. Analogously, 
Cauchy problem for equation([7]) is exactly solvable (with the help of the ultrametric wavelet 
transform) if the energies of local minima are equal: E{x) = const. Therefore in the interbasin 
kinetics approximation the dynamics for a wide class of complex energy landscapes possesses 
analytical investigation. 
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Example: Mb— CO rebinding One of the most important applications of the dynamics 
on energy landscapes and interbasin kinetics is the application to conformational dynamics of 
proteins. In this case the ultrametric parameter x describes the conformational coordinate for the 
protein. 

In paper [13] it was shown that the obtained with the help of p-adic methods results on 
protein dynamics coincide with the data of spectroscopic experiments for Mb-CO rebinding. Mb- 
CO rebinding is a fundamental model in the physics of proteins and plays the role of "the hydrogen 
atom of biology" [2j . 

Let us describe the approach of [1^. Myoglobin can bind CO only when myoglobin is in 

some particular subset of the space of conformations (when the path to the active center of the 

molecule is opened). Consider the model of Mb-CO rebinding described by the equation of 

interbasin kinetics 

~d_ 

dt 



- + D: + n{\x\,) 



f{x,t) = (8) 



Here a is proportional to the inverse temperature (3, the conformational coordinate is parameter- 
ized by the field of p-adic mumbers. The function f{x, t) is the density of occupation of the space 
of conformations for molecules of myoglobin (not bound to CO). The Mb-CO binding takes place 
on the subset of the space of conformations described by the unit ball in Qp. 
Equation (jHj) is a model of ultrametric diffusion with a sink. 

Remark In the model of Mb-CO rebinding ([8]) we get the generator of diffusion with a sink 
in the form of the p-adic Schrodinger operator 

D: + ni\x\p). 

The term with positive potential describes a sink (negative potential will describe a source). The 
Vladimirov operator D" plays the role of a p-adic Laplacian. 

The operator in the RHS of the equation ([7]) have the form of the product of operators 

Df{x) = / ' , .. k^(^V(a:) - e^^(^)/(i/)] duiy) = TXf{x), 
Jx i^ sup Uc,y L -I 



where 



^~f3E{sup{x,y)) 



^^<^' = /v^;r;(;^i«^)-^'^'1'"'<^'- 



is the ultrametric pseudodifferential operator and 

Xf{x) = e^^(")/(a;) 

is the operator of multiplication by the exponent of the potential. 

Therefore in applications of ultrametric analysis to models of interbasin kinetics we get the 
Schrodinger operator (a sum of a pseudodifferential operator and an operator of multiplication by 
a function), and a product of a pseudodifferential operator and an operator of multiplication by a 
positive function. 



5 Appendix 1: Clustering 

In the present section we discuss the clustering procedure for metric spaces. Denote (M, p) the 
metric space M with metric p. 

Definition 2 A sequence of points a = xo,xi, . . . , x„_i, x„ = b in the metric space (M, p) is called 
an e-chain connecting a and b, if p{xk-,Xk+i) < £ for all < k < n. If there exists an e-chain 
connecting a and b, we say that a and b are e-connected. 

In an ultrametric space any two points a b are not e-connected for e < p{a, b). 

Definition 3 Let (M, p) be an arbitrary metric space. Let us define the chain distance 

d{a, b) = inf {e : a,b are e -connected) . 

The chain distance d{a, b) between the points a and b satisfies all the properties of ultrametric 
except for nondegeneracy, i.e. 

d{a, b) = d{b, a) Va, b, 

d{a, b) < max {d{a, c), (i(c, b)) Va, 6, c, 

but it is possible that d{a, 6) = for some a ^ b. 

If the space M is ultrametric (i.e. p satisfies the strong triangle inequality), then the chain 
distance d{-, ■) will coincide with the ultrametric p(-, ■). 

Definition 4 Let us call the cluster C{i,R) in the metric space {M,p) the ball with respect to 
the chain distance with the center in i and the radius R, i.e. the set {j & M : d{i,j) < R}. The 
clustering of the space M is the set of clusters in M, such that any element of M lies in some 
cluster. 

By this definition the set of clusterings is partially ordered: assume we have two clusterings A 
and B of the set S, then ^ > i3, if all clusters of B are subsets of clusters of A. 

Since the chain distance satisfies the strong triangle inequality, any clustering C generates a 
directed tree of clusters T = T[M] and an ultrametric on this tree (the chain distance between 
clusters). Then using the standard procedure (see the Appendix 2) we construct the ultrametric 
space X = X{T) (the chain space of the clustering C), which can be identified with the border of 
the tree T. Clusters in the metric space M correspond to balls in the ultrametric space X. 

Example Consider the important example of clustering. Let D = {di} be a finite or countable 
set of positive numbers without positive accumulation points. Consider the clustering Co of the 
metric space (M , p) which contains all clusters of chain radii d^ & D and arbitrary centers. 

6 Appendix 2: Ultrametric analysis 

In this Section we discuss some results on ultrametric analysis, which can be found in [21], |22j . 

ESI. 



Definition 5 An ultranietric space is a metric space with the ultranietric d{x, y) (where d{x, y) 
is called the distance between x and y), i.e. a function of two variables, satisfying the properties 
of positivity and non degeneracy 

d{x, y) > 0, d{x, y) = =^ x = y; 

symmetricity 

d{x,y) = d{y,x); 

and the strong triangle inequality 

d{x, y) < max((i(a;, z), d{y, z)), Wx, y, z. 

We say that an ultrametric space X is regular, if tliis space satisfies tlie following properties: 

1) The set of all the balls of nonzero diameter in X is finite or countable; 

2) For any decreasing sequence of balls {D'^''^}, D^''^ D D^''^^\ the diameters of the balls tend 
to zero; 

3) Any ball of non-zero diameter is a finite union of maximal subballs. 

Ultrametric spaces are dual to directed trees. Below we describe some part of the duality 
construction. 

For a regular ultrametric space X consider the set T{X), which contains all the balls in X of 
nonzero diameters, and the balls of zero diameter which are maximal subbals in balls of nonzero 
diameters. This set possesses a natural structure of a directed tree. Two vertices / and J in T{X) 
are connected by an edge if the corresponding balls are ordered by inclusion, say I D J (i.e. one 
of the balls contain the other), and there are no intermediate balls between / and J. 

The partial order in T{X) is defined by inclusion of balls, this partial order is a direction. We 
recall that a partially ordered set is a directed set (and a partial order is a direction), if for any 
pair of elements there exists the unique supremum with respect to the partial order. 

On the directed tree T{X) we have the natural increasing positive function which puts into 
correspondence to any vertex the diameter of the corresponding ball. 

Assume now we have a directed tree T with the positive increasing function F on this tree. 
Then we define the ultrametric on the set of vertices of the tree as follows: d{I, J) = F(sup(/, J)) 
where sup(/, J) is the supremum of vertices /, J with respect to the direction. 

Then we take completion of the set of vertices with respect to the defined ultrametric and 
eliminate from the completion all the inner points of the tree (a vertex of the tree is inner if it 
does not belong to the border of the tree). We denote the obtained space X(T), this space is 
ultrametric. 

An ultrametric pseudodifferential operator is defined in the following way. Consider a a- 
additive Borel measure u with countable or finite basis on a regular ultrametric space X. Consider 
the pseudodifferential operator 



Tf{x) = J T{snp{x,y)){f{x) - f{y))du{y) 

Here T{I) is some complex valued function on the tree T{X). The supremum 

sup{x,y) = I 
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of the points x, y G X is the miniinal ball / in X, containing both points. 
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